PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Mapoly0006s0237.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Marchantiophyta; Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia
Family C2H2
Protein Properties Length: 2417aa    MW: 262135 Da    PI: 5.706
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Mapoly0006s0237.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002223852409123
                           EEETTTTEEESSHHHHHHHHHH..T CS
              zf-C2H2    1 ykCpdCgksFsrksnLkrHirt..H 23  
                           y+Cp+C ++F+  s++ rH r+  H
  Mapoly0006s0237.1.p 2385 YTCPICEQTFRFVSDFSRHKRNtgH 2409
                           9*******************98666 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM005452.4E-131556IPR003349JmjN domain
PROSITE profilePS5118313.911657IPR003349JmjN domain
PfamPF023751.0E-121750IPR003349JmjN domain
PROSITE profilePS5118435.068283470IPR003347JmjC domain
SMARTSM005581.2E-46283470IPR003347JmjC domain
SuperFamilySSF511972.0E-26297473No hitNo description
PfamPF023733.1E-34316453IPR003347JmjC domain
SMARTSM003553523042326IPR015880Zinc finger, C2H2-like
SMARTSM00355523272349IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015712.5723272354IPR007087Zinc finger, C2H2
PROSITE patternPS00028023292349IPR007087Zinc finger, C2H2
SuperFamilySSF576672.21E-1523362394No hitNo description
Gene3DG3DSA:3.30.160.609.7E-1023532379IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
SMARTSM003550.01223552379IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015711.86323552384IPR007087Zinc finger, C2H2
PROSITE patternPS00028023572379IPR007087Zinc finger, C2H2
Gene3DG3DSA:3.30.160.605.2E-823802406IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015711.32323852414IPR007087Zinc finger, C2H2
SMARTSM003550.1423852409IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028023872409IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009741Biological Processresponse to brassinosteroid
GO:0009826Biological Processunidimensional cell growth
GO:0033169Biological Processhistone H3-K9 demethylation
GO:0048577Biological Processnegative regulation of short-day photoperiodism, flowering
GO:0048579Biological Processnegative regulation of long-day photoperiodism, flowering
GO:0005634Cellular Componentnucleus
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 2417 aa     Download sequence    Send to blast
MGEMDIAPWL KTLPLAPEFR PTEAEFLDPM AYILKIEEEA RMYGVCKIIP PYNKASKKTV  60
AFHLNRSLAM SRDTLPSSKM HGPCPSMSRS MGGLMAGPQL VKQKSSSLDV SATDSSGKAK  120
FDTRRQQVGW NPKKTRGVAH SQTHKLVWES GEKYTLEQFE QKAKQFSRQR LGTCKDVSPL  180
SVETLFWKAA ADKHSFPVEY ANDIPGSAFA EPLDSSLSLR GKKRKRGLDE PDQGFGMGFR  240
GEESLPIEES PGADEELDGF DSIRLAAGSA EGAGDSGGSA GGKLANSAWN MRNVARSHGS  300
LLRYMPDEVP GVTSPMVYIG MLFSWFAWHV EDHELHSLNY LHTGASKTWY AVPGDAAPAL  360
EEAVRVHGYG GQLNAREVAA VLPTGSTHPT SVSCPAFSLL GEKTTVMSPE VLVAAGVPCC  420
RLVQNAGEYV VTFPRAYHLG FSHGFNCGEA ANFATPGWLD VAKDAAARRA AMNYLPMLSH  480
QQLLYLLTMS LPPRMPASSP SEPRSSRLKV RKKSPGEEMV KNMFVNDVIH NNRLLGVLLD  540
KGVPCCLLAK DAVSHAAAKM PSLEDAEQRL RPADNSLPAS GCLEQVGDCT NLEVAVCSVV  600
QTEDVACPNQ ISSVKLADSS FGMSPSENLL IAEAETNDAF LKVADPCASS EFPGSTTSGL  660
ETKASPYCRN SAVVPSPLAI DWGILPCAAC GILCYATMAV VEPTLTALTT FKALPVKPVR  720
PAVGLSNGKS EAGCDRPESL GLAVDTRASM DLNNGVLVSQ EQAERADVRS EPVDTTGVYG  780
QMTGIGTPPA EAEHAVALKY EILPTQSSVC NFEASLSDDV LSAKNKTPGN TSSLHASELE  840
DAQPLEGRPA LESETGKLEN NGNGRAGSET SPIEVPEAQN AKENVSRGAG SEVHSLETAT  900
QPEVKGSNLL SSLQLLVSTY DEDASDVEGE DFVEEKLEDW EADKGIQVLV RPDSVVGHLI  960
NDVIGLSADV SWQTFNPTTA EMKVISHIST SSPSVFQSVL EGAKNLDSVF TSGGFAFVEE  1020
DRLVDLPTLE RILNEKDSGE CGKKAISDGC SPCSDSFEDP SDALDDEKVD DMQEFYRRIN  1080
SQALTVESQD DELALINLDC PTEFSWQPEP SCSNDVISGA VKCEAETSNS RIKPKNLGAC  1140
HLEPVGREEI GPIKDAASLT SRSMLSPGMN CSRNKISYSA CHPTFEGVLY GRLEEKAFED  1200
KMDCRIHDVS KHLIGDQVSY YSGLPPSFDF VASISRDTEE FNPRTCSSQG RESCPTDQNV  1260
GKAGSEGLGP EKVSQAPIAA VKQRARGGVG RPRVLCLEHA VEAQKRLQDI GGANILIVCH  1320
SGFEDYELRA KDIAQELGIE HTWRDITFSK GSTEEIELVK MAVDVEENDD HGYTDWISQL  1380
GMLVHPRFTT KESEAFDSFK KVTGALRAGP FKKAKPGRPT GSNLMAGRLG ISRKRVDDLK  1440
SYPSSRASVD FEGQLKGSKK KKCMVAGKWC GKVWRVNQVH PLLGGCRSID SSANLGSTSM  1500
SVNISSLSAE ITKLGLTRGL PTIHGPLGSM TTGVEDLQNA AAKKRGRQRK MLEHTKDQVS  1560
EDSSLGPMRN HMVSKTYERK ISEQGKDQVS EDSSGGAVRT IDLVPRNEWK MLEQAKDQGS  1620
EDSSLGPLKV QLVGTSNKAC SAADSEGGQY ASMLQQNLQF KGSYDQEENA DNNKDDCCSP  1680
SNGAEPAGYP GHSNAHGVVP SLSAMTEPTP LAAYDLQTTD DKNKGSSCVL NPPTTQRSAG  1740
VDDSSLTEHS EGVIPLDGDS CALPAATAQT VAPSQSGWQV LSELEQDAVG KHNGDPPDSP  1800
QVHSRGSLTE PQISAPDAFG QQPIVTQSRL PIARASSEPG SLFCSAQHPT KDGGSRAQRI  1860
GKGAAVWCDV PIASSIPWTA GNVTDQAAHP PSSVHPSHNG NAMYFGACSD QAEDTGYDVS  1920
NNVVRPIKGK LSNSFVKTHG LEDSISIEKH QSALKMRKML EFERPQSKRS PADVTSSEEP  1980
VNDVESITTS KMLGWKRKTD GVTANASGKK AKSYDSGKNV KFQEEPVMLA PNNDKTVALI  2040
DEVNNVQELC ESSCHPENEL EQFDYEDDGS QHTLRDIKYV YGVSDSRSEF LAQAEGFAGE  2100
DTSDNHLPGE QIVEDQDSCV PVTMLDSHDY GRQKLQKSRM PYSSRSPPPD DSGQCASGLH  2160
PLDLLPLITS SRGKPLGKTR GKRSKPLKQI WTSLKHPQLE ADKNEGKRDM NSSEAHNIQA  2220
VQCVEARFEA EDCDRGPEIV EHPLGRTSGQ STRLRARVLP VDELDSGDDG DEPKVTAEKG  2280
KKKGRPKKKK VVGRKPKGEE DKEFQCDLDG CRMSFATEAE LGIHKKNRCT RCNKRFFMHK  2340
YLLQHRRVHQ PDRPLKCPWP GCQNAFKWAW ARTEHIRVHT GERPYTCPIC EQTFRFVSDF  2400
SRHKRNTGHT KRKAQE*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4igq_A6e-471645416360Os05g0196500 protein
4igp_A6e-471645416360Os05g0196500 protein
4igo_A6e-471645416360Os05g0196500 protein
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1220226GKKRKRG
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP44011217
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.11e-143C2H2 family protein